Visual Speech Recognition for Kannada Language Using VGG16 Convolutional Neural Network

نویسندگان

چکیده

Visual speech recognition (VSR) is a method of reading by noticing the lip actions narrators. significantly depends on visual features derived from image sequences. stimulating process that poses various challenging tasks to human machine-based procedures. VSR methods clarify using machine learning. helps people who are hearing impaired, laryngeal patients, and in noisy environment. In this research, authors developed our dataset for Kannada Language. The contained five words, which Avanu, Bagge, Bari, Guruthu, Helida, these words randomly chosen. average duration each video 1 s 1.2 s. learning used feature extraction classification. Here, applied VGG16 Convolution Neural Network custom dataset, relu activation function get an accuracy 91.90% recommended system confirms effectiveness system. proposed output compared with HCNN, ResNet-LSTM, Bi-LSTM, GLCM-ANN, evidenced

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

EMG-based wrist gesture recognition using a convolutional neural network

Background: Deep learning has revolutionized artificial intelligence and has transformed many fields. It allows processing high-dimensional data (such as signals or images) without the need for feature engineering. The aim of this research is to develop a deep learning-based system to decode motor intent from electromyogram (EMG) signals. Methods: A myoelectric system based on convolutional ne...

متن کامل

Speech Command Recognition with Convolutional Neural Network

This project aims to build an accurate, smallfootprint, low-latency Speech Command Recognition system that is capable of detecting predefined keywords. Using the Speech Commands Dataset provided by Google’s TensorFlow and AIY teams, we have implemented different architectures using different machine learning algorithms. Our models include: Vanilla Single-Layer softmax model, Deep Neural Network...

متن کامل

Deep Recurrent Convolutional Neural Network: Improving Performance For Speech Recognition

A deep learning approach has been widely applied in sequence modeling problems. In terms of automatic speech recognition (ASR), its performance has significantly been improved by increasing large speech corpus and deeper neural network. Especially, recurrent neural network and deep convolutional neural network have been applied in ASR successfully. Given the arising problem of training speed, w...

متن کامل

Neural Network based Kannada Numerals Recognition System

This paper presents a novel approach for feature extraction in spatial domain to recognize segmented (isolated) Kannada numerals using artificial neural networks. Artificial neural systems represent the promising new generation of information processing networks to develop intelligent machines which can be used as classifier. The ability of neural networks to learn by ordinary experience, as we...

متن کامل

Vehicle Color Recognition using Convolutional Neural Network

Vehicle color information is one of the important elements in ITS (Intelligent Traffic System). In this paper, we present a vehicle color recognition method using convolutional neural network (CNN). Naturally, CNN is designed to learn classification method based on shape information, but we proved that CNN can also learn classification based on color distribution. In our method, we convert the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Acoustics

سال: 2023

ISSN: ['2624-599X']

DOI: https://doi.org/10.3390/acoustics5010020